Rank in Wordlist | Frequency | Word |
---|---|---|
8037 | 27 | 3,000 |
10631 | 20 | 1,500 |
11125 | 19 | 2,000 |
12919 | 16 | 4,000 |
14449 | 14 | 1,000 |
14459 | 14 | 5,000 |
17650 | 11 | 6,000 |
18327 | 11 | כ-3,000 |
19026 | 10 | 7,000 |
22544 | 8 | 1,200 |
Rank in Wordlist | Frequency | Word |
---|---|---|
7487 | 29 | 20% |
7488 | 29 | 30% |
7759 | 28 | 5% |
8032 | 27 | 10% |
8671 | 25 | ב-0.3% |
8995 | 24 | 25% |
9017 | 24 | ב-0.1% |
9361 | 23 | ב-0.2% |
10208 | 21 | ב-0.4% |
11662 | 18 | 15% |
Rank in Wordlist | Frequency | Word |
---|---|---|
15355 | 13 | S&P |
44525 | 3 | H&M |
56940 | 2 | B&O |
57137 | 2 | R&F |
57138 | 2 | R&R |
60994 | 2 | ה-P&C |
60999 | 2 | ה-S&P |
82731 | 1 | AT&T |
83314 | 1 | Gault&Millau |
83616 | 1 | M&M's |
Rank in Wordlist | Frequency | Word |
---|---|---|
2675 | 79 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
1291 | 149 | ג'יימס |
1575 | 126 | ג'ונסון |
1647 | 121 | הג'יהאד |
2443 | 86 | הג'יהאד האיסלאמי |
2506 | 84 | ג'ון |
3119 | 69 | אנג'לס |
3371 | 65 | מנצ'סטר |
3520 | 62 | ג'יי |
3583 | 61 | ג'ורג |
3776 | 59 | תורג'מן |
Rank in Wordlist | Frequency | Word |
---|---|---|
27888 | 6 | 1+1 |
44423 | 3 | 45+1 |
81396 | 1 | 26+6 |
81864 | 1 | 45+6 |
82588 | 1 | 90+1 |
82589 | 1 | 90+2 |
82590 | 1 | 90+4 |
83403 | 1 | Hot+Cool |
95661 | 1 | גרוש+2 |
109217 | 1 | וה-90+6 |
Rank in Wordlist | Frequency | Word |
---|---|---|
10364 | 21 | ו/או |
12273 | 17 | 2019/20 |
15344 | 13 | 24/7 |
20604 | 9 | 2017/18 |
22946 | 8 | גלבוע/גליל |
27889 | 6 | 1/2 |
31623 | 5 | 2015/16 |
31624 | 5 | 2023/24 |
32206 | 5 | בן/ת |
32622 | 5 | האקס/ית |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots